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Histo-blood group antigens (HBGAs) are a source of antigenic variation between individuals that modulates 
resistance and susceptibility to pathogens and is a barrier to the spread of enveloped viruses. HBGAs are also 
produced by a few prokaryotes where they are synthesized by glycosyltransferases (GTs) related to human 
HBGA synthases. Here we report the first structure of a bacterial GT of this family, from an intestinal 
resident, Bacteroides ovatus. Unlike its mammalian homologues and other GTs with similar folds, this 
protein lacks a metal-binding Asp-X- Asp motif and is fully active in the absence of divalent metal ions, yet is 
strikingly similar in structure and in its interactions with substrates to structurally characterized 
mammalian metal-dependent mammalian homologues. This shows how an apparently major divergence in 
catalytic properties can be accommodated by minor structural adjustments and illustrates the structural 
underpinnings of horizontal transfer of a functional gene from prokaryotes to vertebrates. 

Mammalian genomes encode five GTs from Carbohydrate Active Enzymes (CAZy) family 6, a group of 
enzymes that catalyze the transfer of a-galactose (a-Gal) or a-N-acetyl-galactosamine (a-GalNAc) to 
the 3-OH group of a P-linked Gal or GalNAc in an acceptor substrate. These GTs include the histo-blood 
group (HBG) A and B synthases (known as GTA and GTB), an a-l,3-galactosyltransferase (oc3GT) that catalyzes 
the synthesis of the xenoantigen or a-gal epitope, isogloboside 3 synthase (iGb3S) and Forssman glycolipid 
synthase (FS). oc3GT, iGb3S and FS are inactive in humans because of missense and frame-shift mutations in 
their genes, while allelic variation in blood group genes results the production of various combinations of O, A 
and B antigens (HBGA) in different individuals. The lack of active a3GT and iGb3S results in the absence of oc-Gal 
epitopes from human tissues, and presence of antibodies (about 1 -3 % of the IgG) against the a-Gal epitope in the 
circulations of all humans while the lack of A and/or B antigen in individuals is associated with the presence of 
antibodies against the absent antigen. The production of antibodies against glycans produced by family 6 GTs is 
thought to result from exposure to these antigens on bacteria and enveloped viruses (reviewed by Brew et al 1 ). 

Vertebrate GT6 members are multi-domain membrane proteins with small N-terminal cytosolic domains, a 
transmembrane helix, a stem and a C-terminal catalytic domain. Their catalytic activities are completely depend- 
ent on divalent metal ions, particularly Mn 2+ . Crystallographic structures have been determined for the catalytic 
domains of human GTA and GTB 2 5 and bovine a3GT in free states and in complexes with substrates and 
inhibitors 69 ; structure-function relationships in these enzymes have also been investigated by extensive muta- 
tional studies 1012 . GT6 members have a GT-A fold, one of the two predominant folds among GTs (GT-A and GT- 
B) and share, with most other GT-A fold GTs, a Asp-X-Asp (D-X-D) sequence motif that interacts with the metal 
cofactor and donor substrate (UDP-GalNAc or UDP-Gal). 

BoGT6a is one of two GT6 paralogs encoded by the B. ovatus genome and is a representative of the GT6s that 
are found in at least 25 other Gram-negative bacterial species from four phyla (Firmicutes, Deferribacteres, 
Proteobacteria and Bacteroidetes) and genes from various unidentified bacterial species in the human gut 
metagenome. The bacterial GT6 have a N-terminal catalytic domain and a C-terminal membrane-association 
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basic-hydrophobic-basic (BHB) domain 1 suggesting that they assoc- 
iate with the bacterial cell membrane to facilitate their roles in the 
synthesis of the lipopolysaccharide O-antigen 13 . We have previously 
expressed a C-terminally truncated form of BoGT6a, lacking the 
membrane-binding region that is less prone to aggregation than 
the full-length protein 14 . Enzymatic characterization of this form of 
BoGT6a showed that it catalyzes the transfer of GalNAc from UDP- 
GalNAc to 2'-fucosyllactose (FAL) and similar molecules. Most sig- 
nificantly, BoGT6a was found to be fully active in the presence of 
EDTA and not stimulated by Mn 2+ or Mg 2+14 . Since its homologues 
from other bacteria, with one exception, also have an Asn-X- Asn (N- 
X-N) sequence in place of the metal-binding D-X-D motif, it seems 
likely that these are also metal-independent. The exception is the 
GT6 from a member of the phylum Chlamydiae, Parachlamydia 
acanthamoebae, which has a D-X-D motif, lacks a BHB domain 
and is similar in size and sequence to a GT6 from a bacteriophage, 
the cyanophage PSSM-2. Numerous genes that encode GT6 homo- 
logues that resemble PSSM-2 and appear to be from other bacterio- 
phages or marine bacteria are present in the Marine Metagenome 
database. 

Here we present the crystal structure of the C-terminally truncated 
form of the metal-independent BoGT6a at high resolution and in a 
complex with an acceptor substrate 2'-fucosyllactose (FAL). The 
structure exhibits remarkable similarity to the catalytic domains of 
human GTA and GTB. Also, we have used isothermal calorimetry 
(ITC) to investigate the binding of the donor and substrates, UDP- 
GalNAc and FAL, respectively, and have developed a model of the 
complex of UDP-GalNAc with BoGT6a. P. acanthamoebae GT6 was 
expressed and found to have a similar substrate specificity to BoGT6a 
but requires divalent metal ions for activity. Finally, based on our 
experimental data, we discuss the role of the metal ion in members of 
the GT6 family and the evolutionary significance of metal depend- 
ence and independence. 

Results 

Overall structure of apo BoGT6a. The structure of native apo- 
BoGT6a encompasses 10 [3-strands, 4 a-helices and three 3jq 
helices calculated using STRIDE 15 . The overall topology of BoGT6a 
is strikingly similar to those of characterized mammalian members of 
GT6 family with sequence similarity of 49% to the catalytic domains 
of ABO(H) blood group enzymes - GTA, GTB and oc3GT (Fig. 1). It 
is truncated at the N-terminus relative to the catalytic domains of the 
mammalian enzymes by about 46 residues, a region that has been 
thought to be necessary for correct folding of oc3GT 16 . The native 
structure of BoGT6a has a disordered region (residues 126-151) that 
could not be modeled because of lack of visible electron density. 
However, in the substrate-bound form (BoGT6a-FAL), electron 



density for this region is clearly visible and its structure is 
stabilized by inter- and intra-molecular interactions that were not 
observed in the native structure (discussed below). The root mean 
square deviation (r.m.s.d. calculated using TMALIGN 17 ) for Ca 
atoms and all atoms between native BoGT6a and the four mole- 
cules (A, B, C and D) of the BoGT6a-FAL complex are 0.77-0.87 
and 0.99-1.3 A respectively. The flexible C-terminal segment beyond 
residue Ile228 undergoes a significant conformational change on 
binding the acceptor when compared with the native structure 
(Fig. 2a). Similarly there are conformational rearrangements of 
residues 66-67 and 175-190 associated with FAL binding. For 
further discussion, these regions are designated Loopl (126-151), 
Loop2 (175-190), Loop3 (66-67) and C-term (228-236) (Figs. 2a, 
Fig. SI). 

The native structure of BoGT6a showed the presence of three ions 
(presumably from the crystallization buffer), a Ca 2+ ion located close 
to the C-terminus of the protein, interacting with Glu216, Asp230 
and Asn232, a Cl~ ion bound by a group of solvent molecules in a 
loop between Met29 and Phe34. The ions are distant from the N-X-N 
motif (—19 and 14 A for Cl~ and Ca 2+ ions respectively). In addition, 
one bound HEPES molecule and some 200 water molecules were 
identified in the structure (Table 1). 

Comparison of BoGT6a with other members of the GT6 family. 

BoGT6a is closely similar in structure to GTA (PDB id: 1ZI1 18 ), GTB 
(PDB id: 1R7U 19 ) and oc3GT (PDB id: 1GX4 6 ) (Fig. 1) with overall 
r.m.s.d 17 values of 1.96 (179 Ca atoms), 1.96 (180 Ca atoms) and 
2.15 A (200 Ca atoms) respectively. At the sequence level BoGT6a is 
49%, 48% and 48% similar to GTA, GTB and a3GT respectively. The 
four molecules of the BoGT6a-FAL structure superimpose on GTA 
and GTB with an overall r.m.s.d. value of 1.6 A and with a3GT, the 
r.m.s.d. values range from 1.75 to 2.0 A. The principle differences in 
r.m.s.d. values between BoGT6a and the BoGT6a-FAL complex and 
their mammalian homologues are mostly attributable to the 
conformational changes in Loop2, Loop3, and the C-terminus of 
the molecule. 

Conformational changes associated with acceptor (FAL) binding 
and implications for acceptor specificity. In the complex of 
BoGT6a with FAL, the acceptor substrate binds in a pocket 
(Figs. 2b-c) and interacts with residues in the C-terminal half of 
the polypeptide chain. The complex is highly ordered, as indicated 
by the clear electron density for the substrate (Fig. 2d), and includes a 
major conformational change in the C-terminal region of BoGT6a 
(compared with the native), which adopts a 'partially closed' 
conformation (Fig. 2e). The acceptor molecule interacts with the 
protein through a network of H-bonds with Hisl22, Thrl34 and 




Figure 1 | Cartoon representation of secondary structural elements of (a) BoGT6a (present study), (b) GTA (PDB id: 1ZI1) and c) a3GT (PDB id: 
1GX4). N- and C- terminus of the protein molecules are labelled. The N-terminal extension in GTA and a3GT are marked. The manganese ion in GTA 
and a3GT are shown as purple spheres. 
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Figure 2 | (a) C" superposition ofBoGT6a and BoGT6a-FAL structures.Structural variations in loop regions Loopl (126-151) in red, Loop2 (175-190) in 
forest green, Loop3 (66-67) in sand, C-term (228-234/236) in purple of BoGT6a-FAL are shown. BoGT6a loops and C-term are coloured in marine blue; 
(b) Surface potential charge representation of BoGT6a with the bound FAL and modelled UDP-GalNAc molecule; (c) Chemical structure of FAL. Only 
the 1st carbon atom positions are labelled for each of the monomeric unit; (d) Observed electron density [(2Fo-Fc) map contoured at 1.0a] for the bound 
FAL in the structure of the complex; (e) Acceptor binding site of BoGT6a with interacting residues and ligand FAL (2FAL) shown as ball-and-stick model. 
The position of Trpl89 both in native BoGT6a (in yellow) and BoGT6a-FAL complex (in green) is shown and labeled in red (BoGT6a) and in black 
(BoGT6a-2'-fucosyllactose). The reorientation of this loop (Loop2) stabilizes Loopl that shares hydrogen bonding interactions with the bound FAL. 
Residues interacting with 2 ' -fucosyllactose from molecule B are coloured in grey. 



Glul92 (from molecule A) and Lysl28 and Glul32 (from molecule 
B) (Fig. 2e, Table 2). Corresponding residues from molecules C and 
D are also involved with FAL binding but display subtle variations in 
their interactions. This indicates that the interaction of molecule B 
with the ligand bound to molecule A (and molecule C and D) of 
BoGT6a is a crystallographic artifact. Hydrophobic interactions, 
particularly with the aromatic side chains of Phel25, Trpl89 and 
Trp218, are also important for FAL binding. The amino acid 
sequence alignment in Fig. SI shows the locations of the flexible 
loops and residues that interact with substrates and highlights the 
high levels of sequence identity between the prokaryotic and 
mammalian GT6. Several residues that interact with FAL are 
conserved between BoGT6a and its closest mammalian homo- 
logues GTA and GTB including Hisl22, Thrl34, Tyrl53, Trpl89 
and Glul92 (Fig. SI and Fig. 3). In a3GT which, unlike GTA/B 
and BoGT6a, binds acceptor substrates that lack a 2'-fucosyl 
moiety, there are substitutions for acceptor-binding residues that 
can be linked to its specificity: His 122 is substituted by Gin and 
both Gly 124 and He 228 by Trp (residues marked as Al, A2 and 
A7 in Fig. SI). Trp249 (A2) in oc3GT interacts with the non-polar B- 
face of glucose in lactose, undergoing a large conformational change 
to accommodate the acceptor and being stabilized in its new location 
by a H-bond with Asp340. A similar conformational change is not 
required in either BoGT6a or GTA/B as the corresponding residue is 
Gly in BoGT6a and GTA, and Ser in GTB (Fig. SI and Fig. 3a-c). The 
residue at position A7 is also a key determinant of acceptor substrate 



specificity; the smaller side chains in GTA/B and BoGT6a (Ala 
and He, respectively) allow the accommodation of the 2'-fucose of 
the acceptor substrate (Fig. 3a-c) but Trp356 of a3GT does not 
(Fig. 3d). 

Glul92 (marked A5 in Fig. SI) forms H-bonds with FAL and is 
projected to also H-bond with the donor substrate. It is conserved in 
all known members of the GT6 family and has been shown to have a 
key role in catalysis in both metal- dependent and metal-independent 
GT6. Previous studies have shown that substitution of Glul92 by Gin 
in BoGT6a results in a 2 X 10 4 reduction in fc cat for GalNAc transfer 
to FAL 14 . The highly conserved Trp 189 (Fig. SI), interacts with the 
acceptor substrate in all structurally characterized GT6 members 
(Fig. 3). 

Overall, on binding the acceptor substrate, Loop 1 is stabilized by at 
least three hydrogen bonds through residues Lysl28 and Thrl34. 
Loop2 undergoes a major shift, affecting Trpl89, that forms a stack- 
ing interaction with the bound FAL. 

The C-terminal region of BoGT6a, like the C-terminus of a3GT 
undergoes a major conformational change (between open and closed 
conformations) upon ligand binding 9 . Lys231 of the C-terminal region 
of BoGT6a in one of the four molecules in the asymmetric unit inter- 
acts with the fucose moiety of FAL through hydrogen bonds. The 
equivalent residue in tx3GT is Lys359, which directly interacts with 
the phosphate moiety of the bound product, UDP (Fig. S2), in the 
closed state and through a water mediated hydrogen bonding inter- 
action with the acceptor substrate, N-acetyllactosamine 6 . Mutational 
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and structural studies have shown that Lys359 and Arg365 of oc3GT 
play an important role in catalysis 6 '" and Lys231 of BoGT6a is also 
functionally important since the Lys231Ala mutation in BoGT6a 
generates a >25-fold reduction in /c cat . 

Donor substrate (UDP-GalNAc) interactions. We have unsuc- 
cessfully attempted to grow crystals of BoGT6a and its low-activity 
Glul92Gln mutant in a complex with UDP-GalNAc. However, we 
were able to investigate the binding of the substrates, UDP-GalNAc 
and FAL by isothermal titration calorimetry (ITC) in HEPES buffer, 
pH 7.5 containing 1 mM DTT and 0.2 M NaCl. The heat released 
was measured as aliquots of ligand were added successively to 
solutions of the enzyme (1.433 ml, 13 uM) in the stirred reaction 
cell at 25°C. A binding isotherm was generated by plotting the 



integral of the heat of binding against the concentration of ligand 
which can be analyzed as described to calculate the association 
constant (K a ), enthalpy of binding (AH obs ), stoichiometry of 
binding (N) and entropy of binding (AS), assuming a 1:1 
stoichiometry of binding (see Methods for discussion and details). 

The ITC studies with wild-type BoGT6a show that it binds UDP- 
GalNAc with high affinity in the absence of metal ions and H-antigen 
(Table 3, Fig. S3). Analysis of the ITC data, with the assumption of 
1 : 1 stoichiometry, gives a K A value for UDP-GalNAc (59 uM) in 
good agreement with the Ki value determined by steady state kinetics 
(67 uM) supporting the basis of this analysis. The results also show 
that the binding of UDP-GalNAc is enthalpy- driven with a highly 
favourable (negative) enthalpy change ( — 23.5 kcal/mol) partially 
balanced by an unfavourable entropy change ( — TAS of 17.7 kcal/ 
mol). BoGT6a catalyzes UDP-GalNAc hydrolysis at low rate (<0.1% 
of transferase activity 14 ), which could contribute to the measured 
heat of UDP-GalNAc binding. To determine if this could affect the 
thermodynamic profile for this interaction, we carried out an ITC 
study of UDP-GalNAc binding to a very low activity mutant of 
BoGT6a (E192Q 14 ). Although the (negative) enthalpy of binding 
was less (by about 7 kcal/mol), there was a similar reduction in the 
negative entropy contribution to binding, resulting in an essentially 
unchanged K a and AG of binding. This indicates that UDP-GalNAc 
hydrolysis does not affect the general conclusions regarding UDP- 
GalNAc binding. ITC studies of FAL binding to wild-type BoGT6a 
indicated weak, low affinity binding. However, the addition of 3 mM 
UDP in the protein and ligand solutions ( 1 1 -fold greater than the K<j) 
resulted in a stronger signal and affinity, reflected in a K d of 77 uM 
(AG= — 5.6 kcal/mol) and a negative enthalpy change of — 15.4 kcal/ 
mol partially compensated by an unfavorable entropy of binding 
( — TAS of 9.8 kcal/mol). This suggests that the binding of UDP 
may increase the strength of FAL binding either by inducing a con- 
formational change or by direct interactions of the (3 phosphate O- of 



Table 2 | Hydrogen bonding interactions between BoGT6a and 
FAL. The distances given are the range observed in all four mole- 
cules of BoGT6a FAL. Protein atom labels italicised are from a 



symmetry related molecule. Atom labels are as shown in Figure 2 
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Figure 3 | Acceptor binding pocket among GT6 family members showing the conserved residues and interactions, (a) BoGT6a in complex with FAL; 
(b) GTA in complex with H-antigen (PDB id: 1LZI); (c) GTB in complex with H-antigen (PDB id: 1LZJ); (d) a3GT in complex with LacNAc (PDB id: 
1GX4). Water molecules are shown as cyan spheres. 



the bound UDP with the acceptor 3 -OH as in «3GT 21 . Overall, ITC 
studies indicate that the binding of donor and acceptor substrates to 
BoGT6a is not ordered, but appears to be synergistic since binding of 
a donor substrate analog enhances the binding of acceptor. Along 
with kinetic studies that indicate a sequential mechanism (both sub- 
strates bind prior to catalysis) this suggests (but does not prove con- 
clusively) that BoGT6a may have a random sequential mechanism 14 . 

Models of complexes of UDP-Gal and UDP-GalNAc with 
BoGT6a were generated by molecular docking of UDP-Gal and 
UDP-GalNAc using a3GT in complex with UDP (PDB: 1GX4 6 ), 
UDP-Gal (PDB: 2VS5 11 ), as templates and UDP-GlcNAc extracted 
from its complex with MurG (PDB: 1NLM 22 ). Models of BoGT6a 
docked to UDP-Gal and UDP-GalNAc were energy minimized using 
the CNS software suite 23 ; the modeled complexes with UDP- GalNAc 
and UDP-Gal shows that the mode of donor substrate binding in 
BoGT6a is closely similar to those of a3GT and GTA/GTB 
(Figs. 4,5), emphasizing the high level of conservation of donor 
substrate binding site, and active site residues in general between 
the bacterial and mammalian GT6 family members. The residues 



are shown in ball-and-stick representation in Figure 5 and exhibit 
H-bonding interactions to the donor substrate through their main- 
chain or side-chain atoms (Table 4). Mutational studies with 
BoGT6a have shown that residues A 155 GGL 158 (Fig. 4a) are critical 
for its specificity for UDP-GalNAc vs UDP-Gal 14 . The corresponding 
residues in a3GT (Fig. 4c) are H 280 AA1 283 and screening combinator- 
ial libraries with mutations in this region identified a mutant of cx3GT 
with the sequence A 280 GGL 283 that has greatly enhanced GalNAc 
transferase activity 10 . The two substitutions in the corresponding 
region between GTA and GTB are responsible for their respective 
specificities for UDP-GalNAc and UDP-Gal 24 . Other residues of 
BoGT6a: Ile8, ThrlO, Thr70, Arg73, Asn95, Hisl90 and Aspl91 
appear to form H-bonds with the UDP-GalNAc. Of these residues 
the main chain atom O of Ile8, the main chain and side chain atoms 
of ThrlO and side chain atoms OD1 and ND2 of Asn95 directly H- 
bond with the UDP moiety (Fig. 4a). Residues Thr70, Arg73, His 190 
and Asp 191 interact through multiple H-bonds with the GalNAc or 
Gal moiety of the donor substrate. Asn95 interacts with the ribose 
and phosphate moiety of the UDP molecule. Mutation of Asn95 of 



Table 3 Thermodynamic parameters for the binding of substrates and substrate analogues to BoGT6a. Titrations 
(298 K) with 1 3 uM BoGT6a in 20 mM HEPES buffer pH 7.5 containing 0.2 M NaCl and 1 mM DTT 

Ligand UDP-GalNAc El 92Q variant UDP-GalNAc WT FAL 


were conducted at 25°C 
FAL 


Other ligands 








UDP (3 mM) 


Ka(*10- 3 ) 


18.0±0.4 


17.8±0.5 


0.81 ±0.02 


13.02±0.24 


AG (kcal/mol) 


-5.78±0.02 


-5.80±0.02 


-3.97±0.03 


-5.61 ±0.01 


AH (kcal/mol) 


-16.4±0.2 


-23.5±0.3 


-13.5±0.3 


-15.4±0.1 


TAS (kcal/mol) 


-10.6±0.2 


-17.7±0.3 


-9.6±0.7 


-9.8±0.1 
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BoGT6a to Asp has been shown to result in a 36-fold increase in the 
K m for UDP-GalNAc and a >4000-fold decrease in k cM whereas the 
substitution of Asp for the Asn97, has only minor effects on the K m 
and fc cat 14 . 

GT6 from parachlamydia acanthamoebae, halls coccus (PaGT6). 

Based on structures and molecular phylogeny analyses the GT6 
family form three clades (1) prokaryotic enzymes with a minimal 
catalytic domain and a D-X-D motif represented by the cyanophage 
PSSM-2 enzyme, its Marine Metagenome homologues and PaGT6 
(2) the multi-domain vertebrate enzymes and (3) the bacterial 
enzymes with C-terminal membrane binding BHB domains and 
N-X-N replacing D-X-D. An array of evidence supports the notion 
that the evolution of the GT6 family included horizontal gene 
transfer (HGT) between prokaryotes and an ancestor of the 
vertebrates 1 , raising the question about the route(s) of gene 
transfer. HGT is facilitated by donor and recipient having a shared 
environment 25 as exemplified by the array of GT6-positive bacterial 
species that are symbionts or pathogens of various vertebrates and 
the many GT6 sequences in the Marine Metagenome that are 
probably from uncharacterized cyanophage or cyanobacteria. P. 
acanthamoebae is an endosymbiont that can infect humans as well 
as acanthamoebae, and the simpler structure of PaGT6 and other 
group (2) members identifies them as models for the ancestor of the 
GT6 family that have characteristics expected in vectors for HGT. To 
investigate its functional properties we cloned and expressed PaGT6 
as a soluble His-tagged Sumo fusion construct. The protein, purified 




Figure 5 | Catalytic site ofBoGT6a showing the bound acceptor molecule 
FAL (magenta) and modeled UDP-Gal (yellow) as ball-and-stick model. 

Interacting residues are shown as ball-and-stick model and coloured in 
green. 



by Ni 2+ -chelate column chromatography migrated as a single band 
with an apparent molecular weight of 40 kDa, consistent with the 
size expected from it sequence (PaGT6 27 kDa plus the Sumo-His 
tag of 12 kDa). 

As in previous studies 14 glycosyltransferase activity measurements 
were conducted using a radiochemical assay with UDP-[ 3 H]-Gal and 
UDP-[3H]-GalNAc as potential donor substrates and H-antigen tri- 
saccharide (2'-fucosyl-N-acetyl lactosamine), FAL and N-acetyl lac- 
tosamine as possible acceptor substrates in the presence and absence 
of Mn 2+ or Mg 2 *. These showed that PaGT6 catalyzes the transfer of 
GalNAc from UDP-GalNAc to H-antigen trisaccharide or FAL in the 
presence of either Mn 2+ or Mg 2 *, with Mg 2 * giving higher activity. It 
was inactive in the absence of metals and with LacNAc as acceptor 
substrate. A detailed study of its properties will be reported elsewhere. 

Discussion 

The metal-independent (bacterial) and metal-dependent (vertebrate, 
phage and Parachlamydia) members of GT6 provide an example of 
functional divergence in closely similar proteins. The GT6 family 
consists of retaining GTs that catalyze a reaction in which the ot- 
configuration of the transferred carbohydrate in the donor substrate 
is also present in the product. Based on structural and other evidence 
it appears that GTs of this type employ a S^i mechanism involving 
the formation of a short-lived carbocation transition state in which 
the incoming nucleophile (acceptor) is activated by the UDP leaving 
group 26 ; this contrasts with the double displacement mechanism 



Table 4 Potential hydrogen bond interactions 
and BoGT6a in their modeled complex 

Protein Atom (BoGT6a) Ligand Atom (UDP-Gal) 


between UDP-Gal 
Distance (A) 


Ile8 O 


02 


3.45 


ThrlOO 


N3 


3.23 


ThrlON 


02 


2.84 


Tyrl 3 OH 


02A 


2.97 


Asn69 ND2 


04 


3.02 




N3 


3.27 


Arg73 NH2 


03' 


2.48 


Asn95 OD1 


03D 


2.85 




OlA 


2.61 


Asn95 ND2 


03' 


3.09 




02' 


2.91 


Ala96 N 


03D 


2.72 


Glyl56 0 


03' 


3.21 


Glyl 57 N 


02' 


3.19 


Hisl90NE2 


06' 


2.99 


Aspl91 OD2 


04' 


3.41 


Aspl91 OD1 


04' 


2.54 


Glul92 N 


04' 


3.07 


K231 NZ 


OlA 


3.35 




02A 


3.41 
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utilized by retaining glycosidases, enzymes that catalyze an analog- 
ous reaction. In the better-characterized metal-dependent enzymes 
the metal ion functions as a Lewis acid catalyst, facilitating the depar- 
ture of the UDP leaving group, but also functions in donor substrate 
binding and may have a role in moving the departing group in the 
active site during catalysis 26,27 . 

In general, GT-A fold GTs from different families are metal-depend- 
ent and have degenerate metal binding motifs similar to D-X-D. The 
exceptions are some sialyltransferases (STs) and the retaining GT, core 
2 P-l,6-N-acetylglucosaminyltransferase (C2GnT;C2GnT-L), a mem- 
ber of GT family 14 28 . STs are structurally heterogeneous and include 
members of five different GT families that have both GT-A and GT- 
B folds 29 ; they are all metal-independent but their donor substrates 
differ in being a monophosphate derivative (a CMP-sialic acid) as 
opposed to diphosphate (e.g. UDP) 29 . Crystallographic studies of 
complexes of core 2 P-l,6-N-acetylglucosaminyltransferase with 
substrates together with mutational studies suggest that basic resi- 
dues close to the C-terminus of the catalytic domain function in 
donor substrate binding and catalysis and may have a similar role 
to the metal ion in metal-dependent GTA-fold GTs 28 . Together, 
these GTs reinforce the perspective that GTs with GTA folds can 
be metal-dependent or metal-independent, however, the GT6 family 
differs from other GTA fold families in encompassing both metal- 
dependent and metal-independent forms. 

The close similarity between the active site structure and sub- 
strate interactions of BoGT6a and the metal-dependent mammalian 
GT6 members suggests that they have similar catalytic mechanisms, 
generating uncertainty regarding the role of the metal ion in cata- 
lysis. There is an obvious link between metal binding (and metal 
dependence) and the D-X-D sequence. Mutational data are consist- 
ent with the general features of our model of the UDP-GalNAc 
complex with BoGT6a, since mutation of Asn95 of the N-A-N motif 
to Asp, disrupted donor substrate binding and greatly reduced fc cat . 
Metal-independence appears to be linked, at least in part, to the 
replacement of the aspartates of the D-X-D motif in the metal- 
dependent GT6s, by asparagine, eliminating charge repulsion 
between the phosphates of the donor substrate and the aspartyl side 
chain carboxyls. In the modeled BoGT6a complex with UDP- 
GalNAc, the amide of Asn95 H-bonds with 02 of the a-phosphate, 
in keeping with the effects of mutating this residue to Asp. Our 
mutational studies have shown substitution of Ala for Lys231, of 
the C-terminus of BoGT6a reduces k cat by 26-fold, whereas the same 
substitution for Arg229, produces only a 3-fold loss. This supports 
the interaction of Lys231 with the donor substrate as shown in 
Fig. 5. Thus, in relation to substrate binding, interactions of the 
phosphates of the donor substrate with the amide moiety of 
Asn95 may replace those with the metal ion. The role of the metal 
ion in stabilizing the UDP leaving group could be replaced by inter- 
actions with basic residues near the C-terminus, particularly Lys231. 
The existence of homologous metal-independent and metal- 
dependent groups raises the question whether metal dependence 
and independence has some adaptational significance. There are 
interesting parallels between the GT6 family and 3-deoxy-D- 
manno-octulosonate-8-phosphate synthase (KD08PS), a bacterial 
enzyme that catalyzes the synthesis of a precursor of the endoto- 
xin of Gram-negative bacteria. There are two subdivisions of the 
KD08PS family that are dependent on a divalent metal (Zn 2+ or 
Fe 2+ ) for activity or are metal-independent. The metal-dependent 
forms bind the metal ion through the side chains of a cysteine, a 
histidine and a glutamate, whereas in the metal-independent 
enzymes, the cysteine is replaced by asparagine 30 . It is interesting 
that metal- dependence and metal-independence can be conferred 
by substituting Asn for the metal-binding Cys and vice versa, 
although full catalytic activity can require up to three additional 
substitutions 3133 . It seems possible that in GT6, as well as 
KD08PS, the metal ion may function only in substrate binding. 



In summary, BoGT6a is surprisingly similar to human GTA in 
structure, and also utilizes precisely equivalent residues for binding 
substrates, despite a major functional divergence from vertebrate 
GT6s in having metal-independent catalytic activity. The results of 
molecular phylogeny analyses indicate that a metal-dependent pro- 
karyotic GT6, typified by those of P. acanthamoebae GT6 and cya- 
nophage PSSM-2, was the ancestor of the vertebrate clade through 
horizontal gene transfer 1 . BoGT6a and its homologues in other bac- 
terial symbionts and pathogens in humans catalyze the synthesis of 
histo-blood group antigens on the bacterial surface that potentially 
contribute to the generation of antibodies against non-self histo- 
blood groups. The GT6 gene products in bacteria and vertebrates 
have acquired some structural adaptations required for the ex- 
pression of their biological functions. One is the attachment to 
membranes, through the acquisition of a C-terminal "BHB" mem- 
brane-association domain and N-terminal membrane-insertion and 
stem domains, respectively. Also, the metal-independence of the 
bacterial enzymes may be an adaptation to low availability of divalent 
metals in the cytosol of Gram-negative anaerobes. 

Methods 

Protein expression and purification. The N-terminally His-tagged form of BoGT6a 
was expressed in E. coli BL21(DE3) cells and purified as previously reported 14 The 
purified protein was dialyzed against 20 mM Tris-HCl, pH 7.9, containing 0.1 M 
NaCl and 2 mM dithiothreitol (DTT); 10 mM EDTA was added to the buffer for 
storage. 

P. acanthamoebae genomic DNA was utilized as a template for PCR using primers 
designed to amplify the GT6 gene with additional sequences on the 5 'and 3' end for 
cloning into the Expresso T7 SUMO peTite vector (Lucigen). 

Forward: 5 ' -CGCGAACAGATTGGAGGT TTGTTATTTAGCCACTCTCT- 
TTATG-3' 

Reverse: 5 1 -GTGGCGGCCGCTCTATTA TTATCCCTTTCGCATTTCTT- 
CATG-3' 

Cloning and transformation into BL21 (DE3) E. coli cells was performed following 
the instructions of the supplier. BL21 cells were grown to OD 600 of 0.5-1.0 and 
induced with 10 mM IPTG. Soluble protein was isolated and SUMO-HIS-GT6 
protein was purified by Ni 2+ chelate chromatography using Ni 2+ -NTA Super Resin 
(Qiagen) with elution by 150 mM Imidazole. The purified protein (2 mg/litre of 
culture) was dialyzed against 20 mM Tris, 0.2 M NaCl pH 7.5. 

Crystallization, X-ray data collection and processing. Native BoGT6a was 
concentrated to 6-8 mg/ml prior to crystallization. Both the native and FAL complex 
were crystallized using the sitting drop vapor diffusion method with a Phoenix 
crystallization robot on a 96 well Intelli-plate (Art Robbins Instrument). The drops 
were set up at a 1 : 1 ratio of protein to mother liquor and incubated at 16' C. Single 
crystals of native (apo-protein) BoGT6a grew in well solution containing 0.1 M Tris, 
pH 7.5, 0.2 M calcium acetate and 15% (w/v) PEG 4000 of crystallization screen Clear 
Strategy Screen 2 (Molecular Dimensions Ltd., UK). To form the BoGT6a-FAL 
complex, BoGT6a (4 mg/ml) and FAL (10 mM) were incubated overnight at 16°C 
and crystals were grown in a well solution containing 20% PEG 3350, 0.1 M Bis-Tris 
propane, pH 8.5 and 0.2 M Na/K tartrate of crystallization screen Pact Premier 
(Molecular Dimensions Ltd., UK). 

Diffraction datasets for native BoGT6a (to 1.91 A) and the BoGT6a/FAL complex 
(to 3 A) were recorded at Diamond Light Source (Didcot, Oxon-UK) on station 102 at 
100 K. Cryo cooling was achieved by stabilizing the crystals (prior to X-ray data 
collection) in 25% PEG 4000 and 30% PEG 3350 for the native and FAL complex 
crystals respectively. Even though the native crystals diffracted to 1.7 A resolution, 
because of the long c-axis the data were recorded using thin oscillation width (A<1>) of 
0.1° and 0.07°. The crystal was stable in the X-ray beam and over 1000 images were 
recorded to obtain a complete dataset at high resolution. The native dataset was 
processed using HKL2000 suite 34 in primitive tetragonal space group (P4 3 22, a — b — 
41.2 A, c — 282.9 A), while the dataset for the BoGT6a-FAL complex was processed 
using iMosflm 35 in primitive monoclinic space group {¥2^, a — 70.7 A , b — 93.6 A, 
c = 75.4 A, B = 93.9°) (Table 1). 

Structure determination. Initial molecular replacement trials of structural solutions 
were carried out with PHASER 36 , MOLREP 37 and AMoRe 38 software suites from the 
CCP4 package 39 using a BoGT6a homology model built using GTA (PDB: 1WSZ 3 ) 
and ot-l,3-galactosyltransferase (a3GT) (PDB: 2JC0 9 ) as templates, but these failed in 
all crystal systems. 

Further analysis of the data, processed in a primitive triclinic system using the 
Pointless Suite 40 , suggested that the crystal form might be primitive tetragonal. 
Automated model building and refinement trials were performed, for the data 
processed in primitive tetragonal space group, using MrBump 41 , which uses 
CHAINSAW 42 and/or MOLREP 37 as the model editing tool. The solutions were 
ranked based on R tree values, which indicated that the possible space group is either 
VA{1{1 or P4 3 22. These refined models were selected for input to PHASER 36 , which 
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gave the best molecular replacement solution with higher log-likelihood gain values 
for space group P4 3 22 with one molecule per asymmetric unit. However, all the 
molecular replacement solutions froze at a crystallographic R-factor (i? cryst ) value of 
—40% during refinement. 

A careful analysis of the structure- based sequence alignment of the template 
against the preliminary refined model indicated that certain loops had higher thermal 
values that could affect the refinement process. These loops which spanned residues 
65-72, 112-116, 124-156, 181-188 and219-226 were removed from the model. The 
edited model was fed into ArpWarp classic 43 for automated loop building and fitting 
into electron density map of 2 chains with a total of 206 residues. After several cycles 
of ArpWarp classic run 43 and restrained refinement the R cryst and Rf rcc values dropped 
to 0.235 and 0.303 respectively. Several additional cycles of manual model building 
and refinement were carried out using COOT 44 and PHENIX software 45 suites 
respectively {Table 1). 

Phases for the BoGT6a-FAL complex data were calculated by the molecular 
replacement method (using the fully refined native BoGT6a structure as the starting 
model) in space group P2! with four molecules in the asymmetric unit. Clear electron 
density was observed for the acceptor substrate FAL and inserted in to the structure. 
Further refinement and model building were carried out using PHENIX software 45 
suite and COOT 44 (Table 1). In spite of moderate resolution, the structure is highly 
ordered and the information about the structural changes due to acceptor molecule 
recognition is quite evident in all four copies of the asymmetric unit. 

Isothermal titration calorimetry (ITC). Solutions of wild-type BoGT6a and its 
E192Q mutant 14 (12-30 (jJVI) were dialyzed extensively against HEPES buffer 
(20 mM), pH 7.5 containing 250 mM NaCl and 1 mM DTT and degassed prior. 
Isothermal titrations were performed at 25 °C with ligands dissolved in the same 
buffer, UDP-GalNAc (3 mM), and FAL in the presence or absence of UDP (3 mM) 
using a MicroCal VP-ITC micro calorimeter. During each titration, 16 aliquots (10- 
20 ul) of ligand solution were injected, each over 16 sec, spaced at 300 sec intervals. 
The stirring speed was 200 rpm. Heats of binding were determined by integrating the 
signal from the calorimeter, and binding isotherms were generated by plotting the 
heats of binding against the ratio of enzyme to inhibitor. The software package Origin 
5.0 from Microcal Inc. was used to calculate the association constant (KJ enthalpy 
change (AH) and stoichiometry (N). This analysis presumes that the parameter c 
(— [M T ]/Kd, where [MT] is the concentration of enzyme) is between 1 and 1000 20 . 
This condition was not attained for the interactions investigated here (c < 0.35), as in 
previous studies with bovine a3GT 20 , because of the limitations of enzyme solubility 
under these experimental conditions. However, it was possible to increase the 
concentration of ligand sufficiently to saturate the enzyme. Also, a value of 1.0 for N is 
supported by structural data for FAL and is reasonable to assume for UDP-GalNAc 
based on structural data from homologous glycosyltransferases. The free energy of 
binding (AG) and entropy contribution to binding (TAS) was calculated from the 
relationships AG= — RTLn(K a ) and TAS = AH -AG respectively. 

Glycosyltransferase assays. Glycosyltransferase activities of PaGT6 (2 jig) were 
measured using 3 H-UDP-GalNAc as the donor substrate in radioassays as previously 
described 14 with the following potential acceptor substrates- a) 1 mM FAL (Accurate 
Chemical), b) 1 mM blood group H type II trisaccharide (V-labs) and c) 3 mM N- 
acetyl-lactosamine. 
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